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On the Feasibility of Inter-domain Routing via a 

Small Broker Set 


Abstract — The current inter-domain routing protocol, namely, 
the Border Gateway Protocol (BGP), cannot provide end-to-end 
(E2E) quality-of-service (QoS) guarantees. The main reason is 
that an autonomous system (AS) can only receive guarantees 
from its first-hop ASes via service level agreements (SLAs). But 
beyond the first-hop, QoS along the path from a source AS to a 
destination AS is not within the source AS’s control regime. This 
makes it difficult to provide high quality-of-experience services 
to many Internet users even when many content providers are 
willing to pay for such high quality E2E guarantees. In this 
paper, we investigate the feasibility of having a (small) set of 
ASes (or IXPs) to serve as “brokers” to provide high QoS- 
guaranteed E2E transit services. Finding an optimal set of ASes 
as brokers can be formulated as a Maximum Coverage with 
B— dominating path Guarantee (MCBG) problem, and we show 
that it is in fact NP-hard. To address this problem, we design a 
— )— approximation algorithm and also an efficient heuristic 
algorithm when additional constraints (e.g., path length) are 
considered. Based on the current Internet topology, we discover 
that a “3540-alliance” subset of ASes/IXPs can serve as an ideal 
broker set. By studying the collected data from 52,079 ASes/IXPs, 
we demonstrate that it is indeed feasible to provide high QoS 
guarantees for 99.29% E2E connections with only a small subset 
(or 6.8%) of ASes/IXPs serving as brokers. Finally, we provide 
an economic model to show that there are incentives to form and 
maintain such a brokerage coalition. 


I. Introduction 

The Internet is a global system of interconnected computer 
networks consisting of tens of thousands of autonomous 
systems (ASes) [1], With the explosion of Internet traffic, 
end-to-end (E2E) Quality-of-Service (QoS) guarantees, which 
impose a stringent inter-AS QoS support, are becoming more 
and more important. By 2020, global IP traffic will reach 1.3 
ZB per year, in which 82% is IP video traffic [2], and E2E QoS 
guarantees for such applications are urgently needed. However, 
E2E QoS cannot be guaranteed by the current inter-domain 
routing protocol, namely, the Border Gateway Protocol (BGP). 
The main reason is that an AS can only receive guarantees 
from its first-hop ASes via service level agreements (SLAs). 
But beyond the first-hop, QoS guarantee along the path from 
a source AS to a destination AS is beyond the source AS’s 
control regime. From our collected data of 52,079 ASes or 
Internet eXchange Points (IXPs), it reveals that more than 90% 
of E2E AS connections are more than one-hop. 

To address this issue, content providers typically use content 
delivery networks (CDNs) to distribute their contents around 
the world so that most requests can be served by nearby 
copies stored by CDNs. However, the CDN technology is 
not effective for realtime and delay sensitive services such 
as VoIP and video conferencing because for most of these 
applications, the E2E AS hop count is usually larger than one 


and unfortunately, there is no inter-AS QoS support in the 
current Internet. 

To address this issue, we present a novel and easily imple- 
mentable proposal: By utilizing a “small broker set” to stitch 
each inter-AS hop along the AS routing path, not only one can 
provide supervision, control and resource negotiation so that 
the network performance can be quantified, furthermore, the 
ASes’ E2E QoS can be greatly improved. The broker set is 
formed by a small subset of ASes or IXPs, which are selected 
to serve as inter-AS routing brokerage agencies so as to take 
up the responsibilities of network performance measurement, 
control, resource negotiation, as well as to provide transit 
services. When an AS-path can be covered by the broker 
set (i.e., for every AS hop, at least one of its source or 
destination belongs to the broker set), this AS path is said to 
be dominated by the broker set. We would also like to point 
out that the realization of the AS broker set can benefit both 
CDNs and data center networks (DCNs) because the broker 
set can enhance the service QoS while reducing the number 
of needed servers. 

The technical challenge is how to efficiently find such a 
broker set that provides dominating paths for most inter-AS 
connections in the current Internet. To address such challenge, 
we have to consider the following: 

• Which AS/IXP should be in the broker set, which we 
denote as B? 

• How small can B be such that it can provide 
B— dominating paths for most, if not all, inter-AS con- 
nections? 

• Is there any economic incentive to form and maintain 
such kind of broker set? 

Contributions: In this paper, we introduce an inter-AS 
routing framework where a broker set is selected to im- 
prove the ASes’ E2E QoS by dominating the associated 
AS paths. We model the broker set selection problem as 
the Maximum Coverage with B— dominating path Guarantee 
(MCBG) problem and prove its NP-hardness. We further 
propose an approximation algorithm which can provide at least 
— )— guarantee as compared to the best E2E connectivity 
with the dominating AS paths, given the size constraint of B. 
We then propose a heuristic algorithm when additional factors 
(e.g., path length constraints) are considered. The algorithm is 
not only computationally efficient, but also offers a minimal 
reduction of the QoS guarantees of no more than 0.5% E2E 
AS connections. By studying our collected data from 52,079 
ASes/IXPs, we demonstrate that it is indeed feasible to provide 
QoS guarantees for 99.29% E2E AS connections with only 
a small subset of (around 6.8%) ASes and IXPs serving as 
brokers. Finally, we provide an economic model to show that 
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there are incentives in forming and maintaining such brokerage 
coalition. 

The rest of the paper is organized as follows. In Section II, 
we present the related work. In Section III, we describe our 
collected datasets, and provide interpretation of the topology. 
In Section IV, we present our problem statement and the 
approximation algorithm. In Section V, we further consider 
two practical issues such as lower computational complexity 
and the path length constraint, and propose an efficient heuris- 
tic algorithm. In Section VI, we present experimental results 
and findings to demonstrate the feasibility of an inter-domain 
routing using a small broker set. In Section VII, we present the 
economic feasibility that can potentially makes this inter-AS 
routing scheme a reality. Finally, Section VIII concludes. 

II. Related Work 

In this section, we present previous proposals which attempt 
to provide E2E QoS. We classify them in three categories and 
describe their limitations. 

Computing QoS-constrained path. To provide E2E QoS, 
many works focus on finding paths satisfying the QoS con- 
straints. Authors in [3] extend the weighted shortest path intra- 
domain routing based on exchanging link state information, 
and propose an algorithm based on probing packets flowing 
through the Internet so to collect routing information. In [4], 
authors consider one pre-determined route and construct a 
subgraph containing the known route and all its neighbors. 
The connections between AS border routers are extracted as a 
graph whose link weights are assumed to be known. Then it 
uses existing routing algorithms to find QoS-constrained path. 
Authors in [5] propose a distributed solution for calculating 
QoS-constrained path over multiple pre-determined routes. 
The performance of [3] highly depends on the obtained infor- 
mation about routing cost in each domain. Note that collecting 
those information is costly and has the scalability problem. 
The performances of [4], [5] depend on the predetermined 
routes, and they all assume the link weights are known, which 
are not realistic. 

Stitching QoS-enable pathlets. In [6], [7], authors propose 
to use central control points (CXPs) to stitch the QoS enabled 
pathlets (fragments of paths represented as sequences of a 
virtual node) provided by ISPs to construct global paths. A 
CXP is external to an ISP entity and it applies centralized 
inter-domain control over how fractions of Internet traffic are 
routed. These schemes seriously increase the burden of CXPs 
since they need to exchange Internet traffic for ASes and also 
calculate optimal QoS E2E paths for all the routing requests. 
Economic method. Authors in [8] seek to develop an eco- 
nomic plane solution for E2E QoS. ChoiceNet [8] introduces 
marketplaces in which providers and users can meet and sup- 
ply the minimally necessary semantics for them to exchange 
information. Yet, the actual holder of a marketplace is still 
unclear, and providers often do not have the detailed quality 
information of different AS routes for all end users. 

Finally, all works listed above have the scalability problem 
when facing a large scale network such as the Internet. In 
the dataset we collected, there are tens of thousands of ASes. 


In this work, we aim to improve the E2E QoS by finding 
a small set of ASes to stitch each two AS hops along the 
AS routing path to provide supervision, control and resource 
negotiation for each AS hops. One attractive property of our 
approach is that, if all connections are bidirectional, we can 
achieve QoS guarantees for 99.29% E2E connectivity with 
only 6.8% ASes/IXPs as brokers; if considering the directional 
connections, we just need a minor change to the current AS 
peering relationships to serve 72.5% E2E connectivity with 
quality assurance with only 2% ASes/IXPs as brokers. 

III. Topology and Datasets 

Since the Internet topology heavily influences how one 
should model and design an effective inter-domain routing 
strategy with E2E QoS support, here, we first present our 
collected data, which include AS sources and their routes, and 
then we describe our data processing method. 

In this study, we consider the AS-level topology. The AS- 
level Internet ecosystem can be considered as a logical fabric 
of the Internet. AS-level topology, which is composed of 
different ASes and their interconnections, has been widely 
used to characterize the Internet traffic. There are basically 
two mechanisms to connect ASes. One is via dedicated links, 
which relies on the business agreement between two ASes, 
e.g., provider-to-customer peering or P2P peering. Another is 
to make use of a physical interconnection infrastructure called 
the Internet eXchange Point (IXP), which provides efficient 
and cost effective means for traffic exchange between multiple 
ASes. If an AS wants to enjoy this cheap traffic exchange, it 
has to register as a member of the corresponding IXP. We 
collected data for the AS topology as well as connections to 
IXPs. We then built a network topology to cover both direct 
and IXP-based connections. 

Currently, there are some excellent public Internet AS-level 
topology datasets. Here we adopt the dataset from [9], which 
offers the most comprehensive and long-term data. The AS 
topology is constructed using BGP data of IPv4 collected by 
Route views, RIPE RIS, PCH and Internet2 [9]. The data 
are stored on a monthly basis. To make a complete AS-level 
topology, we use the data of the whole year for 2014. In 
addition to the traditional AS topology, we also manage to 
discover those AS connections via IXPs. We obtained the data 
of IXPs membership and IXPs peering in 2014 using similar 
approaches described in [10]. 

It is important to point out that it is inevitable to have an 
incomplete AS topology. This is due to the limited scope of 
the BGP data collection method, e.g., some interconnections 
between ASes may not be discovered. Also, some short- 
life connections may be falsely presented, originating from 
unintentional misconfigurations or intentional trials [9], For 
the IXP data, there are around 400 IXPs which are providing 
global traffic switching services in 2014, and we were able to 
collect around 80% (or 322) of these IXPs based on targeted 
traceroute and targeted source routing techniques. Note that the 
large numbers of ASes, IXPs and connections, as illustrated 
in Table I, show that our dataset is indeed representative. 

Similar to [6], [7], we also assume that IXPs are indepen- 
dent entities. This is proposed due to the rich connectivity of 
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TABLE I 

Summary on the Collected Dataset 


Description 

Numbers 

IXPs 

322 

ASes 

51757 

Size of the maximum connected sub graph 

51,895 

# of Connections 

347,332 

# of Connections among ASes 

292,050 

# of Connections between IXPs and ASes 

55,282 


IXPs [6], [7], This assumption assigns a new role to IXPs 
which typically provide only switching service instead of 
routing. In our experiment, we will show the importance of 
IXPs, in particular, they play a critical role in the broker set. 
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Fig. 1. Visualization of an AS-level Internet topology, which is scale-free 
and layered network consists of IXPs at both its core and edge. 

Figure 1 depicts the visualization of our derived topology 
using the /.'-core decomposition method [11] to review the 
hierarchical structure of the network. Nodes in the figure 
represent either ASes (in color) or IXPs (in black). Edges 
in the figure represent either AS-IXP or AS-AS connections. 
k—core of a graph is defined as the maximal subgraph in 
which each vertex in this subgraph has at least degree k, then 
the “ coreness ” of a vertex is k if it is a vertex in a k— core 
subgraph but not a vertex of a (k + 1)— core subgraph. We 
assign colors according to the coreness of nodes, as illustrated 
by the right side of the figure, to reveal the hierarchical 
structure of the network. Our AS topology consists of more 
than 120 shells of k— cores (each shell is assigned a unique 
color). Furthermore, the figure also shows the degree of nodes. 
The scale of node degree is displayed using the size of a node, 
as illustrated in the left side, a node with the degree of 8,741 
is the largest node in the graph. 

IV. Problems and Algorithms: Theoretical Basis 

In this section, we formulate our inter-domain routing 
brokerage problem and develop some theoretical founda- 
tions. First, we formulate it as a Maximum Coverage with 
B — dominating path Guarantee (MCBG) problem and analyze 
its complexity; then we propose an approximation algorithm 
to solve the MCBG problem. 

A. Problem statement 

Let G = (V, E) denote an undirected graph consisting of the 
vertex set V and edge set E. For each vertex v £ V, we define 


the neighborhood N (v) as the set of all vertices in V that are 
adjacent to v. Similarly, define N ( V ') as the set of all vertices 
in V which are adjacent to \/v£V\ i.e., N (V I ) = U V £V'N (v). 
We first define the notion of a “B— dominating path”. 
Definition 1: Given a graph G = (V. E ), a routing path is 
called a “ B— dominating path” if for every hop along the 
path, at least one of its source or destination vertex belongs 
to the set B, where B C V. 

In the context of inter-domain routing brokerage, we treat 
the AS-level topology we mentioned in Section III as the 
input graph G, where vertex set V is the set of ASes/IXPs, 
a connection between AS/IXP u and AS/IXP v is represented 
by an edge {u, v) in G. If we can find a small set B C V 
such that for every source -destination AS pair, we can find a 
B— dominating path, then the E2E network performance can 
be maintained and managed. Furthermore, since B is small, 
it is easier to create economic incentives to form B such that 
the E2E QoS can be greatly improved. To this end, we aim to 
find a broker set B such that Vv,v' £ V, there exists at least 
one B— dominating path between them. 

Mathematically, the inter-domain routing brokerage problem 
can be formulated as a path-dominating set (PDS) problem. 
Path-Dominating Set (PDS) Problem: Given an input graph 
G = ( V. , E) and an integer k > 1, determine whether it is 
feasible to find a set B C V such that 

• \B\ < k, and 

• there exists at least one B— dominating path between u 
and v, for Vu, v £ V. 

Sometimes, it may not be possible to find a solution 
for the PDS problem to provide all connections with the 
B— dominating path guarantees. Nevertheless, we still want to 
find a small broker set B so as to provide as many connections 
with B— dominating path guarantees as possible. To this end, 
we formulate the optimization version for the inter-domain 
routing brokerage problem in Problem 1. 

Problem 1 Maximum Coverage with B-dominating path 
Guarantee (MCBG) problem 

Input: A connected non-trivial graph G = ( V.E), and a 
positive integer k. 

Output: A subset B C V which guarantees: 

1) \B\ < k; 

2) for Mu,v £ B U N (B), there exists at least one 
B— dominating path between u and v, 

3) / ( B ) = \B U N (B) | is maximized. 


Note that for Vu,v £ B\JN(B), if there exists a path con- 
taining only nodes in B U N(B), the path must be dominated 
by B. Thus the coverage function / can help to evaluate the 
satisfiability of the E2E connectivity with B — dominating path 
guarantee. Now, let us state our first result. 

Theorem 1: If there exists a solution for the PDS problem, 
then it is also the solution for the MCBG problem. If there is 
no solution for the PDS problem, the solution for the MCBG 
problem can provide dominating path guarantees to the largest 
possible source-destination pairs. 

Proof: If there is a solution to the PDS problem, denote it 
as B , which satisfies |B|<fe and can provide B — dominating 
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path guarantee for Vit, v£V . Thus both a and v must connect 
to at least one broker, i.e., B\J N(B)=V . Therefore, B is the 
solution for the MCBG problem. If there is no solution for 
the PDS problem, denote the solution to the MCBG problem 
as B. If there is a set B' which satisfies \B'\ < k and can 
provide B'— dominating path guarantee for Vit, v £ B' U V' 
and \B'CV'\ > \BUN(B)\. To satisfy the S'— dominating path 
constraint, any vertex in V' must connect to at least one broker 
in B', i.e., V’ C B'UN(B'). As \B'UN(B')\ > \B'UV'\ > 
\BJN(B)\, B is not the solution of problem 1. Therefore, there 
doesn’t exist such a set B'. So B can provide B- dominating 
path guarantees for as many connections as possible. I 

B. Computational complexity 

Let us now quantify the computational complexity of the 
PDS problem. 

Lemma 1: The PDS problem is NP-complete. 

Proof: One can prove this by reducing the vertex cover 
problem to the PDS problem in polynomial time. Due to page 
limit, we leave the detailed proof in the Appendix. I 

To analyze the computational complexity of the MCBG 
problem 1, we first consider its decision version. 

Lemma 2: The decision version of the MCBG problem is NP- 
complete. 

Proof: The decision version of MCBG will output an indicator 
“YES” if it is feasible to find a subset B C V which satisfies 
the three constraints in problem 1, and “NO” otherwise. It is 
easy to show that the decision version of MCBG is NP. We can 
show that there is a polynomial-time reduction from the NP- 
complete PDS problem (which is proven to be NP-complete in 
Lemma 1), to the decision version of MCBG problem. Based 
on their definitions, for each instance in the path-dominating 
set B, we can construct an instance of the decision version of 
MCBG by setting p = \ V\. Therefore, the decision version of 
the MCBG problem is also NP-complete. I 

Theorem 2; Problem 1, the MCBG problem, is NP-hard. 
Proof: As the maximization problem is NP-hard if its decision 
version is NP-complete, the MCBG problem is NP-hard. I 

C. Approximation algorithm for MCBG 

Given that the MCBG problem is NP-hard, we propose 
an approximation algorithm to solve the MCBG problem. 
The high level idea is that, we divide the broker set B into 
two parts: B* , pre-selected for approximating the optimal 
coverage, and B 1 , added for guaranteeing the B dominating 
path constraint. 

To find B* , we define the Maximum Coverage with broker 
set B (MCB) problem, and then present its approximation 
algorithm Alg. 1. The selection of B* is realized by Alg. 1. 

For convenience, let MCB(V,k) and MCBG (V,k) de- 
note an instance of MCB and MCBG problem respectively. 
We can use the following approximation algorithm to solve 
MCB ( V , k), in other words, to find B*. 

Now the remaining issue is to find B'. To achieve this, 
we take advantage of the special property of our graph. Note 
that for the AS-level Internet graph we study (e.g., the graph 
depicted in Figure 1), it has a special characteristic, that is. 


Problem 2 Maximum Coverage with broker set B (MCB) 
problem 

Input: A connected non-trivial graph G = (V.E) and a 
positive integer k. 

Output: A subset B C V which guarantees: 

1) \B\ < t, 

2) / ( B ) = \B U N (B) | is maximized. 


Algorithm 1 Approximation algorithm for MCB (V, k) [12] 
Input: The vertex set V and an integer k 
Output: A set B which satisfies \B\ < k 
l: Start with B 0 = 0; 

2: for i = 1 to k do 

3: Si £- arg max s f i U {s}) - / (£*_ i); 

4: Bi 4— Bi_ 1 U {s.j}; 

5 : end for 

6: Return B -- If,. 


more than 99.2% of the source and destination pairs’ hop count 
distances are within 4 hops. This special characteristic helps 
us to design efficient brokerage algorithm. Let us first formally 
define this characteristics. 

Definition 2; A graph G = {V, E) is called an ( a , /?)— graph 
if the following condition is satisfied 


Prob[d(u, v) < (3\ > a \/u,v £ V, 


where d(u , v) is the shortest hop distance between node u and 
v, f3 is an integer which is much smaller than the diameter of 
G and a £ [0.5, 1], 

For example, the AS-level graph we have is a (0.99, 4)— graph. 
Note that the property of (a, fi )— graph can help us to decide 
the size of B' to satisfy the B— dominating path constraint. 
The details about how to solve the MCBG problem, including 
finding B* and B', is shown in the following approximation 
algorithm Alg. 2. 

In Alg. 2, the computational complexities for the selection 


of B* and B' are 0(fc(|V| + [-E])) and 0(k 2 (\V\ log |V| + 
[£])), when adopting the Fibonacci heap implementation of 
the Dijkstra’s algorithm for calculating the shortest path in 
line 5 of Alg. 2, respectively. 

Now we can prove how Alg. 2 can achieve the approxima- 
tion with the pre-selected B*. Let us first present the following 
lemmas to aid the proof. 

Lemma 3: The coverage function f is a submodular and 
nondecreasing set function [13]. 

Lemma 4: Alg. 1 provides (1 — e -1 )— approximation for the 
Maximum Coverage with broker set B (MCB) problem [12], 
Lemma 5: A tree with k vertices can be divided into no more 


than p = 


2(fc — 1) 


+ 1 subtrees in which each subtree has no 


more than m vertices. 

Proof: We design a tree partition algorithm to realize this 
partitioning. Due to the page limit, we leave the detailed proof 
and the proposed tree partition algorithm in the Appendix. I 
Now we are in the position to state the following theorem. 
Theorem 3: When a graph is a (a, ft)— graph, we can obtain 
an approximation algorithm for the MCBG problem with the 



5 


Algorithm 2 Approximation algorithm for MCBG{V,k) on 
a (a, /?) — graph G 

Input: The vertex set V and an integer k 
Output: A set B which satisfies \B\ < k and guarantees at 
least one .B— dominating path between \/u, v G BUN ( B ) 


1 : B* = the solution returned by applying Alg. 1 to 


MCB ( V , x*) and B' = V-B*, where x* = 


k- 1 

m 


+i 


2 

3 

4 

5 

6 


7 

8 
9 

10 

11 

12 


for all r £ B* do 

B' r = 0; 

for all v € B* — {? } do 

Find the shortest path from v to r on G ( V , E); 


Add at most | f | — 1 members along the path to B' r 
to guarantee this path is a ( B * U B ' r ) —dominating 
path (every two adjacent brokers are 1 hop neighbor 
or 2 hop neighbor connected by a non-broker); 

end for 


if \B' r \ < \B'\ then 
B' = B' r - 

end if 
end for 

Return B = B* U B' . 


approximation ratio of 


l—e~ 


such that: 


9 = p = 


p,p is even; 

/3 + 1, /3 is odd. 


( 1 ) 


Proof: Let OPT x » denote the optimal solution for 
MCBG (V,x*), OPTk denote the optimal solution for 
MCBG (V,k), B denote the solution for MCBG (V,k) 
obtained through the Alg. 2. We are trying to prove that: 


Oe 

e — 1 


f(B)>6f ( OPT x .) > / (OPT k ) . 


( 2 ) 


First we will prove Of { OPT x .) > / ( OPT *.). 

As Vu, w £ OPTk U N {OPTk), there exists a 
OP'!),- dominating path between u,w. We will redefine the 
connectivity in OPTk . For any two vertices in the broker set, 
if they are 1 hop neighbors or they are connected by a non- 
broker vertex, we consider these two vertices are connected. 
Thus we have a new defined connected graph of OPTk. Since 
every connected graph has a spanning tree, we can construct 
a tree with size k according to the connected graph of OPTk. 

Based on the result of lemma 5, we know that this tree T 
constructed from the connected graph of OPTk can be divided 


into p= 


2(fc~l) 

m 


+1 subtrees whose sizes are no more than m. 


Here m = x* = 


-4-1 

r§i + 


> to satisfy x*+{x* — 1)( 

I 2 | I 

1) < k, such that after the operation in line 6 of Alg. 2, the 
size of broker set will not exceed the size constraint k. Thus, 




P = 

Denote those p subtrees as 7’,. 1 < i < p and the vertices 
in T, as N,. Based on the property of function / mentioned 


in lemma 3, we have: 

f {OPTk) = f (ULi Ni) < ur=i / m m 

<pf{OPT m ) = 9f{OPT x .) 

Next, we will prove / {B) > f ( OPT x .). 

Denote the optimal solution of MCB {V,x*) as OPT'. 
We have f{OPT') > f{OPT x ») for OPT' does not have 
the OPT'— dominating path constraint. From lemma 4, we 
have f {B) > / {OPT'). And thus we have proved 

(s?T )f(B)>f{OPT x .). ■ 

As for the AS-level Internet topology, 99.2% E2E connec- 
tions are within 4 hops. 

Corollary 1: Given that our AS-level topology is a 

(0.99,4)— graph, Alg. 2 is a approximation algorithm 

for the MCBG problem. 

V. Problems and Algorithms: Practical Considerations 

The previous section provides the theoretical foundation of 
the broker set selection problem. To address the needs of the 
inter-domain E2E QoS guarantee, we have to consider several 
engineering and practical issues. First, to further improve 
the computation efficiency of the approximation algorithm, 
we propose a heuristic algorithm with lower computational 
complexity while maintaining a good B— dominating path 
coverage with broker set B. Second, we consider a more 
general version of the MCBG problem in practice by taking 
the path length constraint into consideration. 

A. Efficient heuristic algorithm and baseline algorithms 

The MaxSubGraph-Greedy algorithm, as depicted in Alg. 3, 
is a pseudocode of an effective and computation algorithm 
for broker set selection. It has a computational complexity of 
0{k(\V\ + \E\)) while maintaining a good B— dominating path 
coverage with broker set B. 


Algorithm 3 MaxSubGraph-Greedy 

Input: A connected non-trivial graph G = {V, E) and a 
positive integer k 

Output: A set B which satisfies \B\ < k 
1: Select a vertex v £ V, and let B = {v}\ 

2: If \B\ = k or V-{BU N (S)) = 0 , then Stop; 

3: Select a vertex w £ V — B, and assign B <- 5 U {w} if 
the size of maximum sub graph in _BU{t(;} is maximized. 
Go to Step 2. 


Note that Alg. 3 aims to maximize the connected graph 
size in each iteration. As we will show, our experiment results 
indicate that Alg. 3 is capable of finding a broker set with 
very high coverage in only few thousand iterations. 

To compare the performance gain of our proposed algo- 
rithm, we compare it with four baseline algorithms, whose 
detailed pseudocodes are listed in the appendix. The Set 
Cover (SC) algorithm is an algorithm proposed in [14] to 
find some but not necessarily the smallest dominating sets. 
We compare with this algorithm to help us to gain some 
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understanding on the importance of a broker set selection 
process. The IXP-Based (IXPB) algorithm returns a set of 
IXPs with a degree higher than a given threshold. Since IXP is 
often treated as an ideal node for inter-domain control [6], [7], 
it is important for us to understand the influence of an IXP if it 
is used as a broker. The Degree-Based (DB) and PageRank- 
Based (PRB) algorithms are greedy algorithms widely used 
in identifying important vertices in a graph. At each round, 
the node with the largest degree or page rank value will be 
added to the broker set. In later experiments, we shall further 
explore their differences in selecting a broker set and examine 
their performance against each others. 

B. Path length constraint and its evaluation method 

Note that in the MCBG problem, for each source and 
destination pair (u, v), the B — dominating path can be of 
arbitrary length. For some ISPs, they may want to restrict the 
number of AS hop counts on the E2E path, e.g., the number of 
AS hop counts should follow some ISPs’ specified distribution. 
Therefore, during the search of the broker set B, we introduce 
an extra requirement on the path length l uv . As a result, we 
further refine the MCBG problem, and we call it MCBG with 
Path Length Constraint, which can be stated as follows. 


Problem 3 MCBG problem with Path Length Constraint 
Input: A connected non-trivial graph G=(V,E), a positive 
integer k and positive integers l uv , representing the path 
length parameter for any pair u, v € V («/ v). 

Output: A subset B C V which guarantees: 

1) \B\ < t, 

2) for Wu,v £ B U N (B), there exists at least one 
B — dominating path of length l uv between a and v; 

3) / ( B ) = \B U N (B)\ is maximized. 


For the AS hop-count to follow a specified probability dis- 
tribution, one can use the following probability interpretation. 
If the choice of a source u and destination v pair can be viewed 
as a random event with every possible source-destination pair 
as the sample space, then the corresponding path length l uv can 
be viewed as a random variable l. If every event of the sample 
space is equally probable, then the normalized histogram of 
l uv would give the probability mass function of the random 
variable l and hence the distribution function of l can also be 
deduced. 

Here, we say a selection strategy is feasible if it gives 
a candidate solution that satisfies the specified path length 
distribution up to e fraction of error for each value of l. 
Mathematically, for a given distribution based path character- 
ization F(l) (where the distribution F(l) is written in terms 
of the cumulative path-length histogram, i.e., the number of 
admissible path with path-length less than or equal to l), an 
algorithm A of producing a broker set Bj, is called feasible 
if the produced set Bj, by the algorithm A give a distribution 
Fb a (1) which deviated from the required F(l) by at most e, 
for all values of l, i.e., 

\Fb a (1) — F(l)\ < e, V/. (4) 


Using Equation (4), to verify the feasibility of a candidate 
algorithm A, we need to be able to compute the cumulative 
distribution Fb a {I uv ) for a broker set B a produced by the 
candidate algorithm A. Hence, let us define a Ba operator 
on the adjacency matrix A of a graph G as Ba ■ A. This 
operation will erase an entry of A whenever neither of its 
row nor column indices belongs to Ba- Let us call the output 
of Ba ■ A as A. This matrix can give the desired cumulative 
Ba — dominating path length distribution Fb a (1) in the follow 
manner: the number of nonzero entries in A 1 gives the number 
of Ba ~ dominating paths with length less than or equal to l. 
Therefore, we call this as the “(—hop E2E connectivity”. 

Tab. II depicts the /—hop connectivity of different topolo- 
gies (e.g., ER-Random, WS-Small-World, BA-Scale-free, ASes 
with/without IXPs) under such evaluation metric. Here ASes 
with/without IXPs are the AS level topologies used in this pa- 
per with/without considering IXPs as independent entities. The 
other topologies, ER-Random, WS-Small-World and BA-Scale- 
free, have the same vertex sets (including 52,079 ASes/IXPs) 
with ASes with IXPs, but the edge sets are generated according 
to the topologies’ features accordingly. Note that for ASes with 
IXPs, if we set / = 4, we have 99.21% E2E connectivity. 

TABLE II 

l — HOP CONNECTIVITY OF DIFFERENT TOPOLOGIES. 


hop count 

1 

2 

3 

4 

5 

6 

7 

ES 

0.37 

4.91 

47.47 

99.30 

99.69 

99.69 

99.69 

WS 

0.24 

2.28 

18.76 

83.23 

99.69 

99.69 

99.69 

BA 

1.11 

26.17 

95.50 

99.69 

99.69 

99.69 

99.69 

ASes w/ IXPs 

10.00 

65.74 

96.65 

99.21 

99.29 

99.29 

99.29 

ASes w/o IXPs 

5.39 

47.98 

90.02 

97.35 

98.00 

98.06 

98.06 


VI. Structural Feasibility and Broker Set’s Properties 

In this section, we describe some experimental results 
showing the possible composition of a broker set in the current 
Internet. We also compare our proposed algorithm on broker 
selection with other state-of-the-art algorithms. Since space is 
limited, we leave some detailed results in the appendix. 

A. Evaluation for l— hop E2E connectivity 

The selection of a broker set is non-trivial. Improper selec- 
tion may lead to a broker set with large size, making it more 
difficult to incentivize ASes to join the broker set, or with very 
poor /—hop E2E connectivity. Fig. 2(a) illustrates the cumula- 
tive distribution function (CDF) of the AS E2E connectivity by 
running the SC algorithm over 300 iterations. Although 100% 
E2E connectivity is guaranteed, the SC algorithm has to select 
around 40,000 nodes in the broker set, accounting for more 
than 76% of the overall network vertices. Fig. 2(b) illustrates 
the achieved /—hop E2E connectivity by varying hop count 
requirement / of the other algorithms. The results of IXPB 
and TierlOnly algorithms, which were considered in previous 
works, show that it is not appropriate to merely rely on the 
IXPs or the tier 1 ISPs to act as brokers. Both of them suffers 
from low E2E connectivity: IXPB algorithm could only reach 
at most 15.70% E2E connectivity with 322 brokers, and it is 
far worse for the TierlOnly algorithm. 
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(a) CDF of broker set size by SC (b) /—hop E2E connectivity of other 
algorithm algorithms 


Fig. 2. Comparison for l — hop E2E connectivity of different algorithms. 

The DB and PRB algorithms, they can lead to serious 
marginal effect: the marginal increase of the l — hop E2E 
connectivity decreases with the increasing broker set size. 
This can be caused by the decreasing correlation between 
degree/PageRank value and saturated E2E connectivity with 
the increasing broker set size. The broker set selected by 
the DB algorithm, which consists of high degree ASes and 
IXPs, can achieve around 72.53% E2E connectivity with 1,005 
brokers. However, the DB algorithm requires a large size 
broker set to guarantee a high (e.g., 99%) E2E connectivity 
for the serious marginal effect when \B\ > 1,000: the DB 
algorithm can only achieve 96.35% E2E connectivity even 
with 9,618 brokers. Fig. 3(a) depicts a more detail view of the 
selected broker set, in which most selected brokers are located 
at the center network core, leaving network edge mostly 
uncovered. The PRB algorithm has similar problem, for the 
PageRank distribution of the undirected graph is statistically 
close to its degree distribution [15]. 

Our approximation algorithm for the MCBG problem can 
achieve 85.71% saturated connectivity with 1,064 brokers and 
99.29% saturated connectivity with 3,688 brokers, making it 
the best algorithms among all we considered. Compared with 
the approximation algorithm, our MaxSG algorithm achieves 
equivalent performance (i.e., sacrificing less than 0.5% con- 
nectivity) while greatly reduces the computational complexity. 
Also, MaxSG algorithm outputs a broker set consisted with 
3,540 members which totally dominate the maximum connect- 
ed sub graph of the given Internet topology, i.e., 51,895 out 
of 52,079 ASes/IXPs, leading to a saturated E2E connectivity 
as high as 99.29%. Finally, Fig. 3(b) presents a close look of 
the selected broker set. Unlike the DB algorithm, our MaxSG 
algorithm does not have an overcrowded network core and the 
network outer ring can be well covered. In the following, we 
discuss some attractive properties of this 3540-alliance broker 
set. 

B. Attractive Properties of the 3540-alliance broker set B 

We name the broker set with 3,540 brokers output by the 
MaxSG algorithm as the “3540-alliance”, and discuss some of 
its attractive properties. 

Minimal Path Inflations: Path length inflations (i.e., previ- 
ously l hops reachable pairs now require l' hops, where V>1) 
are observed in Fig. 2(b). Consider the DB algorithm, with 



(a) Degree-based (b) MaxSG 


Fig. 3. A close look at the brokers selected by PRB and MaxSG algorithms. 
Vertices in black are the brokers. 

1,005 brokers, only 72.40% E2E connection can be satisfied 
within 4 hops, in contrast to that of 90.02% in a free-path 
selection scheme (i.e., denoted as “ASeswithIXPs”). While 
as illustrated in Tab. Ill, if the internal connections inside 
such broker set are bidirectional (i.e., there exist peering 
connections), minimal path inflations via this broker set can 
be achieved (i.e., the E2E connectivity curve of 3540-alliance 
almost overlaps the one of “ASesWithIXPs”). 

Diversified Compositions: As illustrated in Fig. 4(a), the 
3540-alliance consists of different types of ASes and IXPs, 
this avoids monopoly by some tier 1 ISPs. Here, we use the 
same definition and data published at [16] to divide the brokers 
into different categories according to the services they offered. 
Table IV lists some brokers and their rankings as well, which 
also illustrates the importance of IXPs for B — dominating path 
routing with broker set B. 

table in 

The 3540-alliance can guarantee minimal path inflations. 


hop count 

1 

2 

3 

4 

5 

6 

7 

ASes w/o IXPs 

5.39 

47.98 

90.02 

97.35 

98.00 

98.06 

98.06 

ASes w/ IXPs 

10.00 

65.74 

96.65 

99.21 

99.29 

99.29 

99.29 

MaxSG 3540 

9.96 

64.53 

96.09 

99.17 

99.29 

99.29 

99.29 


90% of the E2E Connections Only Used Nodes in the 
Broker Set: As illustrated in Fig. 4(a), although for some 
connections a re-route through non-brokers is still necessary, 
more than 90% E2E connections can be carried out the 3540- 
alliance solely without the aid of non-brokers, which implies 
that the broker set does not need to pay any non-broker 
node (AS or IXP) to complete the traffic transmission. For 
the remaining 10% E2E connections, we will show how to 
incorporate non-broker nodes in Section VII. 

Minimal Changes in the Business Relationships: While for 
real-life inter-domain routing, the business relationships (e.g. 
high-tier and low-tier, or peering) among ASes/IXPs has a 
significant influence and must be taken into consideration. 
Fig. 4(c) shows the actual performance of a broker set in the 
current Internet by forcing them to obey existing business rela- 
tionships probed, i.e., previously assumed bidirectional routing 
policy becomes directional. A sharply decreased E2E connec- 
tivity over different size of broker sets has been observed. 
However, we also notice in Fig. 4(b) that by changing only 
30% inter-broker connections to bidirectional (e.g., peering), 
such degradation can be greatly suppressed. Even a 1000- 
broker set with 30% change at its inter-broker connections 




(a) E2E connectivity via broker set (b) E2E connectivity by changing 
solely & Detailed composition 30% business relationship 
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MaxSG 1000 Core (realistic) 
MaxSG 2000 Core (realistic) 
-•-MaxSG 3000 Core (realistic) 
MaxSG 3540 Core (realistic) 


4 

Hop count 


(c) E2E connectivity with realistic 
business relationship 



Fig. 4. Findings for the 3540-alliance broker set found by the MaxSG algorithm. 


TABLE IV 
Broker List 


Rank 

Type 

Name 

Rank 

Type 

Name 

1 

IXP 

Equinix Palo Alto 

8 

T/A 

TWTC 

2 

T/A 

LVLT-3549 

9 

IXP 

Equinix Chicago 

3 

T/A 

COGENT- 174 

232 

C 

YAHOO- 1 

4 

IXP 

LINX 

260 

c 

ViaWest 

5 

T/A 

ATT-INTERNET4 

380 

c 

Host Virtual, Inc 

6 

T/A 

HURRICANE 

438 

E 

PE Voronov Evgen Sergi 

7 

IXP 

DE-CIX Frankfurt 

470 

E 

Serverius Holding 


1XP: Internet Exchange Point. 

Transit/ Access(T/A): ASes which serve as either transit and/or access provider. 
Content: ASes which provide content hosting and distribution systems. 

Enterprise: Various organizations, universities and companies at the network edge 
that are mostly users, rather than providers of Internet access, transit or content. 


can achieve a 72.5% E2E connectivity, and the 3540-alliance 
with 30% change can achieve 84.68% E2E connectivity. 
Potential for Multi-path Routing: As shown in Fig. 4(d), 
given specific broker sets, we find that a non-broker node 
typically connects to more than 4 brokers on average, this 
means that one can consider the multi-path routing framework 
in AS information delivery to further improve the E2E QoS. 

VII. Economic Incentives 

Now let us discuss the economic feasibility for realizing the 
brokerage scheme. We formulate a game-theoretic framework 
to understand the interactions of ASes and guarantee both 
ASes in broker set B and non-broker set B are willing to 
follow this new routing rule. More specifically, we first treat all 
ASes in B as one identity, and use a Stackelberg game model 
and Nash bargaining solution to study the game between any 
AS in B and B. Then we state the rationale for treating all 
ASes in B as one identity, and discuss the revenue distribution 
among ASes in B. 

A. Interactions of non-broker & broker sets 

We start with the example of the business model illustrated 
in Fig. 5. All ASes in B are treated as one identity, and the 
rationale will be explained later. 

We first discuss the behaviors of ASes in B. Let i denote a 
specific AS in B, and it plays a double role in the game with 
B: (1) customer of B, e.g. AS 1, which has a routing strategy 
o:, to determine the fraction of AS i’s traffic routed to B\ (2) 
employee of B, e.g. AS 5, which is hired by B to transit 


traffic. Fig. 5 offers an instruction of the payment flow. Our 
goal is to guarantee the existing of a steady state of economic 
relationships among all ASes, and make a, — > 1, which means 
that our new routing scheme will be fully adopted, under the 
steady state. 



Fig. 5. A typical example for the business model. 


Interactions between employee AS in B and B. We first 
consider when AS £ B acts as employee of B. Let pn denote 
the routing price of per unit volume traffic charged by B. As 
show in Fig. 5, B can charge from both the customer AS and 
the destination. Sometimes, E2E connectivity can be achieved 
via B solely, e.g., AS 2 and 3 has a connection. When there 
is no direct connection, B will need to hire a non-broker to 
form the B— dominating path, e.g., B hires AS 5 to achieve 
the E2E connectivity, and pays the employee AS p e for per 
unit volume of routing traffic. The cost for each AS to route 
per unit volume of traffic is denoted as c. We apply the Nash 
bargaining solution to achieve an agreement on the price p e 
between employee AS j and B. 

Let u* denote the utility function of employee AS j for per 
unit volume traffic: 


u i =p e -c. 


(5) 


Note that AS j does not have the global knowledge, it does 
not know the exactly hop count of the whole B— dominating 
path. On the other hand, j knows the hop count distance 
between any two vertices is no more than /?, it can therefore 
assume that for this B— dominating path, B needs to hire at 


most 


is at least: 


J employees, and B’s utility for per unit volume traffic 


ul = 2 p B - 


p 

2 


Pe ~ 




(6) 
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Thus the Nash bargaining solution is the solution of opti- 
mization problem: 

max Pe u* ■ u* b , 

s.t. p e > c (7) 

Theorem 4: For the bargaining proposed above, there exists 
the Nash bargaining solution. 

Proof: As the above optimization is maximizing a continuous 
function over a compact set, it must have at least one solution. 

■ 

Interactions between customer AS in B and B. Now 

we consider when an AS £ B acts as a customer of B. For 
any AS i, when acting as a customer, we normalize its total 
traffic volume as 1 . Let qq denote the fraction of traffic routed 
to B in the traditional routing mechanism. Thus a, = 1 and 
= «o represent the cases that the new routing scheme is 
fully adopted or rejected. Let Ui denote the utility function of 
AS i: 

Ui = Vi ( on ) + Pi ( at ) - p B oti, ( 8 ) 

where Vjfo:,) denotes the income charged from end-users, 
Pi(oti) and p/jO:, denote the cost for routing to B and B 
respectively. 

We assume Vi (aj) is proportional to the user’s satisfaction 
level on the QoS. Hence, when a, increases, more traffic is 
routed through B, so the QoS improves. User’s satisfaction 
will increase with the QoS, but the rate of increase will 
decrease due to the law of diminishing returns. Thus V](a,) 
is a continuous, concave and strictly increasing function. The 
value of Pi(a.i) can be positive, negative, or zero if i is a high- 
tier, lower-tier, or peering with other ASes in B. When a,; 
increases, AS i will transfer its traffic to B in the descending 
order of corresponding income (e.g., first high paid, then low 
paid, peering, low charged, and high charged), Pi(ai) is a 
continuous and concave function which is non-decreasing in 
[ao, a*] and non-increasing in [a*, 1], P;(l) = 0. 

The utility of B is: 

u b = 2p B a - C(a,p e ), (9) 

where a = a i v i< anc * C(a,p e ) is the cost function for 

routing data and hiring employees, assumed to be concavely 
increasing in a and p e . 

We model the interactions between AS i £ B and B 
as a Stackelberg game [17], where players make decisions 
sequentially. In particular, we have: 

Players: All ASes in B and B. 

Strategies: Vi £ B determines its routing strategy a, ; B 
determines the routing price p B . 

Roles: B is the first mover and decides p B . ASes in B are the 
second movers and decide aj,i £ B. Each AS i £ B makes 
its decision individually. 

Outcome: The outcome can be determined by backward in- 
duction, i.e., for any given p B , each AS i decides a* = ai(p B ) 
to maximize its own utility; based on this knowledge, B 
decides p B to maximize its utility. 


Theorem 5: For the Stackelberg game proposed above, there 
exists the Stackelberg equilibrium. 

Proof: We first focus on the second stage, e.g., AS i deter- 
mines on for given p B . We argue that for any given p B , the 
optimization problem 

max Qi Ui(ai) = + P^af) - p B 

s.t. ao < cti < 1 (10) 

has a unique solution. 

The reason is straightforward: Ui(ai) is a strictly concave 
function of and [ao, 1] is a convex set. Thus, the solution 

ai{p B ) = argrnax K,;(ctj) is unique. As ai(p B ) and p e (p B ) 
are continuous, so is a{p B ) = a i(Ps)- Thus, we have: 

max ps u b {p B ) = 2 p B a{p B ) - C{a{p B ),p e (p B )), 
s.t. 0 <p B <p B0 , (11) 

where p B o is the maximum price B can set. As the above 
optimization is maximizing a continuous function over a 
compact set, it must have at least one solution. This guarantees 
the existence of the Stackelberg equilibrium. I 

Theorem 4 and 5 guarantee the existence of a steady state 
economic relationship among all ASes. To further explore how 
to achieve a large value of o,:, we consider a simple example 
assuming homogeneous ASes Vi £ B. The result shows that, 
by including high-tier ISPs into the broker set, the lower-tier 
ISPs will be more willing to follow the new rule. 

B. Cooperation among ASes in the broker set 

So far, we regard all ASes in B as one identity and assumed 
they cooperate, e.g. provide most connections to guarantee 
connectivity and reduce path length. Now let us discuss how 
to achieve such cooperation. As cooperation in B can greatly 
improve E2E QoS, B can charge more from other ASes and 
end users. Our goal is to design a fair revenue distribution 
mechanism to guarantee that no AS in B has an incentive to 
leave the broker set. 

For this cooperative game, we apply the Shapley value 
approach [18] to capture the revenue distribution. Let U(B) 
denote B' s profit under the Stackelberg equilibrium. U ( B ) = 
u b {p* B ) if P* B is the price at the Stackelberg equilibrium. U(K) 
denotes set K's profit under equilibrium if KCB is selected 
as broker set. The marginal contribution of AS j f K to K is 
defined as: 

A, (K) = U(K U {j}) - U (. K ) . (12) 

Then, AS j’s Shapley value is: 

<Pj( B ) = j4jj (13) 

1 f ttgii 

where n is the set of all |P|! orderings of B and B(iT,j) is 
the set of ASes preceding j in the ordering of ir. 

We assume the revenue distribution of AS j £ B is equal 
to its Shapley value, for Shapley value satisfies the efficiency. 
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symmetry and fairness conditions [19]. Here, efficiency means 
the sum of Shapley value equals B's total profit. Symmetry 
means if two ASes contribute equally to any subset of B ex- 
cluding themselves, their Shapley values are the same. Fairness 
means for any two ASes in B, their mutual contributions are 
equal. Authors in [18], [20] provide methods to approximate 
the Shapley value in Eq. 13. 

Based on the property of Shapley value, we have the 
following theorems to achieve stable cooperation in B. 
Theorem 6: If \/K,L C B and K fl L = 0, U (KU If) > 
U(K) + U(L) (superadditivity), then the Shapley value is 
individually rational, i.e., tp : j (B)>U ({j}) ,Vj £ B. 

It is clear that the superadditivity condition satisfies in B, s- 
ince only fully cooperation over B can guarantee connectivity. 
This condition guarantee the stability of cooperation in B: no 
AS in B has an incentive to leave for it won’t achieve higher 
revenue by doing so. 

Theorem 7: If Vj € B and VK CLC B\ {j}, A, (K) < 
A j ( L ) ( supermodularity ), then the Shapley value satisfies 

E jeM ^(B)>U(M),\/MCB. 

If the supermodularity condition holds, it can guarantee 
strong stability of B: no subset of ASes has incentive to leave 
B and form another coalition, because their small coalition 
cannot bring more revenue. This finding can give insights on 
the proper size of B. At the beginning of the formation of B, 
supermodularity satisfies easily for the first added ASes are 
super ASes, and the “network externality” effect caused by 
the effect of cooperation. However, when the set size reaches a 
threshold, nearly all important ASes are already in B, thus new 
joiners have a marginal contribution, so the supermodularity 
condition does not hold any more. That’s the time to stop 
increasing the set size. 

VIII. Conclusion 

In this paper, we propose an inter-domain routing brokerage 
framework, in which an inter-AS routing path can be totally 
dominated by a small set of ASes and IXPs to provide E2E 
QoS guarantee. We model the problem as MCBG problem 
and prove it to be NP-hard. To address the MCBG problem, 
we propose a (i±| — )— approximation algorithm. To further 
improve the computation efficiency, we design a heuristic 
algorithm which, compared with the approximation algorithm, 
has equivalent good performance while greatly reduce the 
computational complexity from 0(k 2 (\V\ log | V | + |2£|)) to 
0(k(\V\ + |£?|)). We further investigate the feasibility of 
deploying the broker set in the current Internet from both 
structural and economic perspectives, and show that our pro- 
posed inter-domain routing framework is capable of providing 
enough incentive to persuade ASes to follow. We also show 
that with little change to the current AS peering relationships, 
72.5% E2E connectivity can be served with high quality 
assurance by selecting only 2% ASes/IXPs as brokers. 

References 

[1] L. Gao and F. Wang, “The extent of as path inflation by routing policies,” 
in Proceedings of IEEE GLOBECOM, Nov. 2002. 

[2] “Cisco visual networking index: Forecast & methodology, 2015-2020,” 
http://www.cisco.eom/c/en/us/solutions/collateral/service-provider/ 
visual- networking- index- vni/complete- white- paper- c 1 1 -48 1 360.html. 


[3] S. Norden, “Inter-domain routing: Algorithms for qos guarantees,” 
Computer Networks, vol. 49, no. 4, pp. 593-619, Nov. 2005. 

[4] R. Jacquet, G. Texier, and A. Blanc, “Sanp: An algorithm for selecting 
end-to-end paths with qos guarantees,” in Proceedings of IEEE Future 
Network and Mobile Summit, Jul. 2013. 

[5] N. B. Djarallah, N. L. Sauze et al., “Distributed e2e qos-based path com- 
putation algorithm over multiple inter-domain routes,” in Proceedings of 
IEEE 3PGCIC, Oct 2011. 

[6] V. K. X. Dimitropoulos, R. Kloti et al., “Control exchange points: 
Providing qos-enabled end-to-end services via sdn-based inter-domain 
routing orchestration,” Open Networking Summit, Mar. 2014. 

[7] V. Kotronis, M. Rost et al., “Stitching inter-domain paths over ixps,” 
ACM SOSR, 2016. 

[8] T. Wolf, J. Griffioen et al., “Choicenet: toward an economy plane for 
the internet,” ACM SIGCOMM Comput. Commun. Rev., vol. 44, no. 3, 
pp. 58-65, Jul. 2014. 

[9] “Data source,” http://irl.cs.ucla.edu/topology/. 

[10] B. Augustin, B. Krishnamurthy, and W. Willinger, “Ixps: mapped?” in 
Proceedings of ACM SIGCOMM, Aug. 2009. 

[11] J. I. Alvarez-Hamelin, L. Dall’Asta et al., “Large scale networks 
fingerprinting and visualization using the k-core decomposition,” in 
Advances in neural information processing systems (NIPS), 2005. 

[12] G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher, “An analysis of ap- 
proximations for maximizing submodular set functionsli ,” Mathematical 
Programming, vol. 14, no. 1, pp. 265-294, Dec. 1978. 

[13] “Submodular set function,” https://en.wikipedia.org/wiki/Submodular_ 
set_function. 

[14] A.-H. Esfahanian, “Connectivity algorithms,” http://www.cse.msu.edu/ 
, ~cse835/Papers/Graph_connectivity _revised.pdf. 

[15] F. S. Perra Nicola, “Spectral centrality measures in complex networks,” 
Physical Review E, Sept. 2008. 

[16] “As ranking,” http://as-rank.caida.org/. 

[17] M. J. Osborne, An introduction to game theory. Oxford University 
Press New York, 2004. 

[18] A. E. Roth, The Shapley value: essays in honor of Lloyd S. Shapley. 
Cambridge University Press, 1988. 

[19] R. B. Myerson, “Graphs and cooperation in games,” Mathematics of 
operations research, vol. 2, no. 3, pp. 225-229, 1977. 

[20] S. Maleki, L. Tran-Thanh et al., “Bounding the estimation error 
of sampling-based shapley value approximation,” arXiv preprint arX- 
iv: 1306.4265, 2013. 

Appendix 

Proof of Lemma 1 

Clearly PDS problem is in NP. Without loss of generality, 
assume that our graph does not have any isolated vertices, 
otherwise the output would simply be “NO”. We present a 
polynomial-time reduction from the Vertex Cover problem to 
the PDS problem. 

Given a graph G, we will construct a graph G 1 as follows. 
G' has all edges and vertices of G. Also, for every edge 
( u , v ) £ G , we add an intermediate node w in G'. Keeping 
( u , v ) intact in G', we add vertex w and edges ( u , w ) and 
( w , v) in G'. We can prove that G has a vertex cover of size 
k if and only if G' has a path-dominating set of the same size. 

Given B is a vertex cover in G, we now construct a 
B — dominating path for any two vertices if and v" £ G' to 
show that B is a path-dominating set for G'. We have the 
following cases: 

1) Both v' and v" are original vertices in G : Since G is 
connected, there is a path from if to v" . For every edge 
in the path, at least one of its end points is in B since 
B is a vertex cover. Thus this path is a B - dominating 
path. 

2) Either v' or v" is an additional node in G Suppose if is 
an additional node in G', we call its one hop neighbor in 
G as its proxy. In this case, v 1 has at least two proxies. 
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If v' is in G, its proxy can be itself. Thus both v' and 
v" must have their proxies. We denote the proxies of 
v' and v" as v s and Vd respectively. Since there is a 
B — dominating path between v s and Vd, we can extend 
it to the B — dominating path from v' to v" . 

If G' has a path-dominating set B of size k, then Vi/ £ G, 
either v' £ B or v' has a one hop neighbor in li (If not, v' 
has no B — dominating path to any other vertices in G). Thus 
B can serve as the set cover in G. And \/w £ B but w G, 
w must be connected to exactly two vertices u,v £ G. This 
is because the manner that we construct the graph G' . G' has 
all edges and vertices of G. Also, for every edge (it, v) £ G, 
we add an intermediate node w in G' . Keeping (u,v) intact 
in G ' , we add vertex w and edges (u, w) and (w, v) in G ' .) 
u, w, v form a triangle, we can then safely replace w £ B by 
u or v. The new set B can still be a set cover in G. Thus we 
have constructed a set cover in G with size k. 

Given both sides of equivalence, since the Vertex Cover 
problem is NP-complete, therefore the Path-Dominating Set 
problem is also NP-complete. 

Proof of Lemma 3 


Algorithm 4 can realize the partitioning of dividing a 
with size k (the number of vertices is k ) to p = 2 ^ 1 - ) j 

subtrees whose size are no more than to.. 


tree 
+ 1 


Algorithm 4 Tree partitioning 
Input: A tree T = (V. E), a positive integer m 

Output: Treeset where \J T . eTrees et T i = T 
l: Start with Treeset = 0; 

2: while Can find a node v in tree T - (V, E ) such that each 
child d of v has \Td\ < m and \T V \ > m do 
3: Rank these n children of v such that \T,n | > | Tdi | > 

••• > \Tdn[- 

4: Temptree = |7di|; 

5: for i = 2 to n do 

6: if | Temptree U T di \ + 1 < m then 

7: Temptree = Temptree UTdP, 

8: end if 

9: end for 

10: Remove Temptree and the edge adjacent to v from T; 

11: Add v and edges from v to its children in Temptree 

to Temptree ; 

12 : Add Temptree to Treeset ; 

13: end while 
14: if T ^ 0 then 

15: Add T to Treeset ; 

16: end if 

17: Return Treeset', 


Since p is the number of trees in Treeset. At least 
\Treeset\ — 1 trees have a size in the range of [^,to], and 
for the last one, the tree size belongs to [l,m]. Therefore, we 
have k > ^ (p — 1) + 1. Since p is the largest integer which 


satisfies this inequality, thus p = 


2(fc— 1) 


+ 1 . 


Baseline Algorithms 

The pseudocodes of the three baseline algorithms we dis- 
cussed in this paper are listed as follows. 


Algorithm 5 SetCover (SC) 

Input: A connected non-trivial graph G = (V. E ) 

Output: A set B 

1: Select a vertex v £ V, and let B = {v} 

2: If V — (B U N{B)) = (j>, then Stop 
3: Select a vertex w £ V — (B U N(B)), and assign B 3— 
B U {u>}. Go to Step 2 


Algorithm 6 IXP-Based (IXPB) 

Input: Degree threshold D t , a connected non-trivial graph 
G = (V, E), where V IX p, Vas € V, V IX p n Vas = 4> ■ 
Output: A set B consists of only Vj X p that dominates some 
other vertices, yet only partially 
1: For a vertex v £ Vi X p , assign B B U {r>} if its 
degree > D t 


Algorithm 7 Degree-based (DB) 

Input: A connected non-trivial graph G = (V,E\ degree 
threshold D t 

Output: A set B that dominates some other vertices, yet only 
partially 

1: For a vertex v £ V, assign B B U {v} if its degree 
> D t . 


Algorithm 8 PageRank-based (PRB) 

Input: A connected non-trivial graph G = ( V , E), PageRank 
value threshold 6 

Output: A set B that dominates some other vertices, yet only 
partially 

1: For a vertex v £ V, assign B ■£- f?U{w} if its PageRank 
val ue > (1 


Example for the Proposed Stackelberg Game 

To further explore how to achieve a large value of ai, we 
consider a simple example assuming homogeneous ASes Vi £ 
B. Set: 

Vi (a.i) = k\og (a, ; + l-a 0 ) , n4 . 

Pi (aj) = l-af+2 (aj-l)xpo, 

where k represents the increasing speed of revenue charged 
from end users for QoS improvement (which thanks to the 
increase of a,), and po represents the shape of P, ( cti ). In 
particular, we can see that when p (J is larger, the value of 
Pi(-) is smaller in [ao, 1], indicating that it might be a lower- 
tier ISP. 

We also set: 

C(a(p B ),p e (p B )) = Na{jc+(l-i) [t?c+(l-t?) p e ]},(15) 
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where N is the hop count factor, which represents the length 
of a path segment in B including hops between broker and 
employee, and also hops between broker and broker; 7 is the 
percentage of E2E connectivity via B solely without rerouting 
traffic to B as shown in Fig. 4(a), which is also the probability 
that of not needing to hire employee ASes; when there is a 
need to reroute to B, 77 represents the proportion of employee 
ASes in the path segment, and 77 <1/2 even if any two ASes 
in B are not connected. Here we assume a = a, \B\. 



Fig. 6. 


7 v 

(c) Impact of 7 on a* (d) Impact of 77 on oti 

Factors impacting the adoption of our routing scheme by ASes. 


Without loss of generality, let N\B\ = 100, op = 0.2, 
/? = 4, and c = 0.2. In Fig. 6 (a), we fix p 0 , 7 and 77 , and vary 
k to see the impact of k on a*. We can see that for AS with 
a larger k (i.e., higher revenue increase from users), it will 
route more traffic into the broker set. We use similar method 
to obtain the impact of po, 7 and 77 on a,. Fig. 6 (b) shows 
that if p 0 is larger (i.e., a lower-tier ISP), 0 , approaches 1 and 
the AS will be more willing to follow the new rule. Fig. 6 (c) 
and 6 (d) show that both 7 and 77 will not bring much influence 
to a, . If k and pa are large enough to achieve the threshold 
for cti approximate 1, say k > 5.5 and pi, > 0.35 in the case 
illustrated in Fig. 6 (a) and Fig. 6 (b), o:, remains as 1 when 
we vary 7 and 77 . 

Remark. From the above example, we gain some important 
insights. First, a critical condition to ensure that the ASes 
in B follow the new rule is that the QoS improvement by 
the new scheme is significant, so that such ASes can charge 
higher prices to end users and other ASes. Second, ASes may 
have different attitudes towards the new scheme. In particular, 
lower-tier ISPs may be willing to follow the new rule, but 
higher-tier ISPs may feel differently. One possible way to 
solving this problem is to include such higher-tier ISPs into 
the broker set. 



